Quantitative analysis of mathematical documents
Identifieur interne : 001380 ( Main/Exploration ); précédent : 001379; suivant : 001381Quantitative analysis of mathematical documents
Auteurs : S. Uchida [Japon] ; A. Nomura [Japon] ; Masakazu Suzuki (mathématicien) [Japon]Source :
- International journal on document analysis and recognition : (Print) [ 1433-2833 ] ; 2005.
Descripteurs français
- Pascal (Inist)
- Wicri :
- topic : Base de données, Analyse quantitative.
English descriptors
- KwdEn :
Abstract
Mathematical documents are analyzed from several viewpoints for the development of practical OCR for mathematical and other scientific documents. Specifically, four viewpoints are quantified using a large-scale database of mathematical documents, containing 690,000 manually ground-truthed characters: (i) the number of character categories, (ii) abnormal characters (e.g.. touching characters), (iii) character size variation, and (iv) the complexity of the mathematical expressions. The result of these analyses clarifies the difficulties of recognizing mathematical documents and then suggests several promising directions to overcome them.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000417
- to stream PascalFrancis, to step Curation: 000370
- to stream PascalFrancis, to step Checkpoint: 000388
- to stream Main, to step Merge: 001418
- to stream Main, to step Curation: 001380
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Quantitative analysis of mathematical documents</title>
<author><name sortKey="Uchida, S" sort="Uchida, S" uniqKey="Uchida S" first="S." last="Uchida">S. Uchida</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Department of Intelligent Systems, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Nomura, A" sort="Nomura, A" uniqKey="Nomura A" first="A." last="Nomura">A. Nomura</name>
<affiliation wicri:level="4"><inist:fA14 i1="02"><s1>Department of Mathematics, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Suzuki, M" sort="Suzuki, M" uniqKey="Suzuki M" first="M." last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4"><inist:fA14 i1="02"><s1>Department of Mathematics, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">06-0054253</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 06-0054253 INIST</idno>
<idno type="RBID">Pascal:06-0054253</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000417</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000370</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000388</idno>
<idno type="wicri:doubleKey">1433-2833:2005:Uchida S:quantitative:analysis:of</idno>
<idno type="wicri:Area/Main/Merge">001418</idno>
<idno type="wicri:Area/Main/Curation">001380</idno>
<idno type="wicri:Area/Main/Exploration">001380</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Quantitative analysis of mathematical documents</title>
<author><name sortKey="Uchida, S" sort="Uchida, S" uniqKey="Uchida S" first="S." last="Uchida">S. Uchida</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Department of Intelligent Systems, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Nomura, A" sort="Nomura, A" uniqKey="Nomura A" first="A." last="Nomura">A. Nomura</name>
<affiliation wicri:level="4"><inist:fA14 i1="02"><s1>Department of Mathematics, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Suzuki, M" sort="Suzuki, M" uniqKey="Suzuki M" first="M." last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4"><inist:fA14 i1="02"><s1>Department of Mathematics, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">International journal on document analysis and recognition : (Print)</title>
<title level="j" type="abbreviated">Int. j. doc. anal. recognit. : (Print)</title>
<idno type="ISSN">1433-2833</idno>
<imprint><date when="2005">2005</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">International journal on document analysis and recognition : (Print)</title>
<title level="j" type="abbreviated">Int. j. doc. anal. recognit. : (Print)</title>
<idno type="ISSN">1433-2833</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Character recognition</term>
<term>Database</term>
<term>Document analysis</term>
<term>Ground truth</term>
<term>Mathematical formula</term>
<term>Optical character recognition</term>
<term>Quantitative analysis</term>
<term>Very large databases</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Analyse documentaire</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Base donnée très grande</term>
<term>Base donnée</term>
<term>Analyse quantitative</term>
<term>Réalité terrain</term>
<term>Formule mathématique</term>
<term>.</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Base de données</term>
<term>Analyse quantitative</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Mathematical documents are analyzed from several viewpoints for the development of practical OCR for mathematical and other scientific documents. Specifically, four viewpoints are quantified using a large-scale database of mathematical documents, containing 690,000 manually ground-truthed characters: (i) the number of character categories, (ii) abnormal characters (e.g.. touching characters), (iii) character size variation, and (iv) the complexity of the mathematical expressions. The result of these analyses clarifies the difficulties of recognizing mathematical documents and then suggests several promising directions to overcome them.</div>
</front>
</TEI>
<affiliations><list><country><li>Japon</li>
</country>
<region><li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement><li>Fukuoka</li>
</settlement>
<orgName><li>Université de Kyūshū</li>
</orgName>
</list>
<tree><country name="Japon"><region name="Kyūshū"><name sortKey="Uchida, S" sort="Uchida, S" uniqKey="Uchida S" first="S." last="Uchida">S. Uchida</name>
</region>
<name sortKey="Nomura, A" sort="Nomura, A" uniqKey="Nomura A" first="A." last="Nomura">A. Nomura</name>
<name sortKey="Suzuki, M" sort="Suzuki, M" uniqKey="Suzuki M" first="M." last="Suzuki">Masakazu Suzuki (mathématicien)</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001380 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001380 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:06-0054253 |texte= Quantitative analysis of mathematical documents }}
This area was generated with Dilib version V0.6.32. |